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ABSTRACT 

Cooperative assessment was investigated in a 
classroom setting, examining achievement outcomes, effects on 
motivation, and student perceptions of the cooperative assessment 
process. Eighty-four undergraduate psychology students participated 
in this nonequivalent control group study design. It was hypothesized 
that students taking tests using a cooperative assessment procedure 
would perform significantly better on a posttest of educational 
psychology course concepts than would students completing tests in a 
traditional format. Effects of the treatment on goal orientation, 
perceived ability, self-regulation, and depth of processing were 
examined. Analysis of covariance indicated that there were no 
significant differences between the groups on the posttest, and that 
the hypothesis was not supported. There were also no differences 
between groups on measures of goal orientation, perceived ability, 
and depth of processing. Student reactions to the cooperative 
assessment procedure were overwhelmingly positive. Students enj oyed 
taking tests in groups, and felt that they learned more through this 
process as they discussed and debated the responses to the test 
items. Two tables illustrate the discussion, and two appendixes 
provide supplemental information. (Contains 2A references.) 
(Author/SLD) 
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Absti'act 

Amongst the plethora of cooperative learning studies, several investigations of collaborative, 
cooperative, or group assessment have appeared. These studies have investigated cooperative 
assessment in laboratory conditions (Lambiotte, et al., 1987), and in classroom settings (Bilsky- 
Toma. 1993; Webb. 1993: McCown, 1992), and have examined the effects of cooperative 
assessment on learning (Lambiotte. et al.. 1987; Bilsky-Toma. 1993; Webb, 1993; R. R. 
McCown. personal communication. April 5, 1993) and group process on assessment outcomes 
(Webb. 1993). In this .study, we refined McCown's methodology and investigated cooperative 
assessment in a cla.ssroom setting, examining achievement outcomes, effects on motivation, and 
student perceptions of the cooperative assessment process. 

Eighty-four undergraduate educational psychology .students participated in this non- 
equivalent control group design .study. It was hypothesized that students taking tests using a 
cooperative assessment procedure would perform significandy better on a posttest of educational 
psychology course concepts than would .students completing tests in a traditional format. In 
addition, the effect of the treatment on goal orientation, perceived ability, self-regulation, and depth 
of proce.s.sing was examined. The cooperative assessment group completed exams individually and 
then in groups; student exam grades were a combination of individual and group exam .scores. The 
traditional asses.sment group took exams individually. 

Analysis of covaiiance indicated that there were no .significant differences between the groups 
on a posttest of educational psychology concepts, thus the hypothesis was not supported. There 
were also no differences between groups on measures of goal orientation, perceived ability, and 
depth of proce.s.sing, though there were .statistically significant differences on self-regulation. 
Student reactions to the cooperative a.ssessment process, however, were overwhelmingly positive. 
These data indicated that although there were no statistically .significant differences in achievement 
between ti-eatment groups, .students enjoyed taking tests in groups and felt that they learned more 
through this process as they discus.sed and debated the responses to test items. 
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Cooperative Assessment ^ 

Cooperative learning has been thoroughly documented as an effective learning tool and 
teaching strategy (Johnson & Johnson, 1989; Johnson, Johnson, & Smith, 1990; Slavin, 1991). 
Cooperative learning, students learning from and with each other, helps to develop social skills, 
establish professional working skills, builds a sense of community within a classroom, and 
enhances student achievement, self-esteem, and attitudes toward school at all grade levels, 
including in college classrooms (Johnson, Johnson, & Smith, 1990; Slavin, 1991; Wynne, 1983). 
One explanation for these effects is that cooperative learning techniques augment the extent to 
which content is actively processed by students and offers participants the opportunity for 
discussion and negotiation which may lead to higher-level reasoning and the development of 
strategies which facilitate these processes (Gabbert, Johnson, & Johnson, 1987; Johnson et al., 
1990; Nystrand, 1986). The social support which often arises from a cooperative task has been 
shown to aid students in persisting on a challenging task, reducing frustration, increasing 
autonomy, and contributing to academic and career aspirations (Gabbert, et al., 1987; Sarason, 
Sarason, & Under, 1983). . 

Cooperative techniques require students "to explain what they are learning to each otner, 
learn each other's point of view, give and receive help from classmates, and help each other dig 
below the superficial level of understancing the material they are learning" (Johnson, et al., 1990, 
p. 1 1). While such "digging" and sharing of viewpoints can and does occur during cooperative in- 
ciass practice activities which take place as part of instruction, it is also likely that cooperative 
learning groups will negotiate under test conditions. Many believe that this negotiation of 
understanding is essential for knowledge construction (Duffy & Bednar, 199 1 ; Kember & 
Murphy, 1990; Vygotsky, 1978). Brown (1989) argued that learning is "about the making of 
meaning, not just the receiving of it Thoughuulness is a constructive, not a passive, undertakmg 

^^Several studies have begun to look at the benefits of an extension of the cooperative learning 
process - cooperative assessment Lambiotte et al. (1987) found, under labora.tory conditions with 
college students, that cooperative test-taking led to increased quantity of recall in reading and 
comprehension. Singer (1991) looked at the efficacy of cooperative testing with jumor high 
students in a pre-algebra class. His study revealed an increase in test scores usmg cooperative test- 
taking, as well as an expressed preference by the students for this form of test taking. R.R. 
McCown (personal communication, April 5, 1993) reported simUar fmdings in an unpubhshed 
pilot study of cooperative assessment 

Other studies have investigated the effects of group or collaborative assessment on 
achievement and social process. Bilsky-Toma (1993) found that assigning grades based on group 
responses to an English quiz increased stiident motivation and the quantity of commumcation 
between students and teacher during the test This study also reported disadvantages to 
cooperative testing; such as group instability, noisy classrooms, "stifling" of academically sti-onger 
students, and weaker students riding on the coattaUs of sti-onger students dunng the group quizzes. 

Webb (1993) analyzed the relationship between achievement scores obtained during small- 
group assessment tasks and individual assessment tasks. In Webb's study, sttidents solved 
mathematics operations on decimal numbers in collaborative small groups for a 50-mmute class 
period. Two weeks later, following a review session, students examined a similar problem 
without collaborating with other students. Students consistently performed better m the group 
assessment despite prior ability whereas student performance on the individual assessment was 
accurately predicted by both ability and behavior within the small group setting. 

Proponents of authentic assessment have called for assessment to be more than an end 
product Rather, assessment should be part of the learning process (c. f., Shavelson, B^ter, & 
Pine 1992; Shepard, 1989; Wiggins, 1989). Many educators have had the expenence of retummg 
a test to the class and attempting to help students learn from their mistakes, only to find that many 
of the students are far less interested in learning ffom their mistakes than they are in trymg to 
rationalize their incorrect responses in the hopes J earning extra points. Cooperative ^sessment 
places the ownership for learning in the hands of the students and offers opportunities for the 
negotiation of understanding. This negotiation can be a valuable learning tool and can encourage 



ERIC 



4 



Cooperative Assessment ^ 

students to think about what they have learned and are learning as they discuss the assessment with 

their peers. . .,..,„ j. » i 

In cooperative assessment, students complete unit examinaUons mdividually and unmediateiy 
afterwards in small groups. Since cooperative assessment yields grades that are equal to or higher 
than an individual test score, never lower (Griffin, et al., 1994), a possible negative outcome of the 
procedure is that students may look upon the group assessment portion of the process as a way to 
earn "free" points and subsequently decrease the amount of emphasis they place on prep^g for 
an examination and monitoring their study strategies (self-regulation), and on the value of the 
assessment as a learning ta^ rather than an opportunity merely to earn more points (goal 
orientation). This possible side-effect must be investigated before any assumptions about the 
efficacy of cooperative assessment can be made. Pintrich and DeGroot (1990) included three 
components in a working definition of self-regulation: (a) metacognitive strategies used by 
students in planning and monitoring learning (Brown, Bransfprd, Campione, & Ferrara, 1983; 
Como, 1986); (b) control and management of effort for classroom tasks (Como, 1986); and (c) 
type of cognitive strategies students use in learning and remembering (Como & Mandmach, 1983). 
Learning goals have been described as goals which are pursued with an emphasis placed O" 
becoming more proficient at a task, while performance goals are typically pursued for the purpo^ 
of looking good in front of others (Elliott & Dweck, 1988). Greene and Miller (1993) developed 
the Motivation and Strategy Use Survey to examine self-regulation and goal orientation as weU as 
perceived ability and depth of processing. . -j , 

This study, then, investigated the effects of cooperative assessment on mdividual 
achievement, goal orientation, deptii of processing, self-regulation, and perceived abihty in an 
educational psychology class for preservice teachers. The design differs from that utilized by 
Lambiotte (1987) in that it occurs in a classroom context rather than a laboratory context, and is 
thus more ecologicaUy vaUd. It also differs from those utilized by Webb (1993) and Bilsky-Toma 
(1993) in tiiat students take tests first individually and then in groups to ensure mdi\adud 
accountabiUty as well as group rewards. We have employed the design utilized by Gnffm, et al., 
which was developed by R. R. McCown (personal communication, April 5, 1993) who tound 
positive effects on student grades and student attitudes using this design. 

Method 

SuhjecLs and Design , ^ ^ . , u i ♦ k„ fi«t 

Eighty-five students enrolled in four sections of educational psychology, taught by the tirst 
autiior, participated in tiie study. All students were education majors with a variety of areas of 
emphasis (e.g., early childhood, middle grades, physical education, etc.). There were 16 males 
and 31 females in the treattnent group, and 9 males and 29 females in the control group. 9 students 
in the experimental and 4 students in tiie control group were Black, while the remainder were 
White 

A non-equivalent control group design was utilized, with treatments randomly assigned to 
groups. Scores from a pretest (discussed below) and self-reported GPAs were obtained and 
analyzed to determine if groups differed significantiy in Uiese areas. The groups were not 
statistically different on GPA E d, 84) = 1.46; b = -23 though initial differences did exist on 
pretest performance, control M = 13.17, SD. = 2.50, treatment M = 1 1.91, SJH = 3.14, E(l, 84) - 
4.2; c =.043. Analysis of covariance was utilized in later analyses in an effort to provide some 
statistic^ control for these initial differences in pretest scores. . r t 

The independent variable was testing condition in which participants eiuier took tests tirst 
individually and then as part of a group (cooperative assessment) or completed tests on an 
individual basis only (individual assessment). Students in the treatment group could add up to ten 
points to tiieir individual grade if performance on the group assessment exceeded Uieir mdividuai 
performance, but would hot lose any r ints if individual performance surpassed group 
performance. Cooperative scores (i.e., tiie student's combined individual and group assessment 
scores) could not exceed group - sores. For example, suppose Group A obtained a 96 on their 
group exam. Student Al obtained an 82 on her individual exam, and is thus eUgible for au lU 
group points, bringing her cooperative assessment score to 92. Student A2 scored 88 on the 
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individual exam, and is thus eligible for only 8 group points so that she does not exceed the group 
score, giving her a 96 on her exam. A score of 97 was obtained by Student A3 on the individual 
assessment, so she does not receive any group points and she keeps her grade of 97. Using this 
scoring procedure, we attempted to provide for both individual accountability and group 
incentives. Since students could add only up to 10 points to their individual score, they had to do 
some advance preparation for the exam to obtain a passing grade. By allowing students to earn 
extra points to add to their individual grades, incentive to work together as part of a group was also 
established as it was in everyone's best interest to obtain the highest group score possible to 
maximize the likelihood that students would earn group points and thus increase their exam grade. 

The dependent variables were achievement, as measured on a posttest of educational 
psychology course content and perceived ability, goal orientation, and self-regulation, as measured 
by the Motivation and Strategy Use Survey (Greene and Miller, 1993). Survey data from the 
cooperative assessment group evaluating the cooperative assessment strategy and follow up 
interviews with representative students in the treatment condition were also collected. 



Materials , <. . i. 

Pretest The pretest consisted of 30 multiple choice items, 10 from each of the first three 
units of instruction for the introductory educational psychology course. The topics addressed on 
the pretest were operant conditioning and information processing (unit 1), observational leammg, 
motivation, and outcome decisions (unit 2), and instructional models, instructional tactics, and 
classroom management (unit 3). The items on the pretest were primarily application type multiple 
choice items, and all were matched to unit objectives to ensure content validity. 

Posttest . The posttest was comprised of the same 30 items which appeared on the pretest and 
addressed the fiist three units of instruction. The posttest was administered in three sections, 10 
questions on each of the last three unit exams. Thus, the last three exams were partially 
cumulative. The 10 cumulative items from each of the last three exams were added together to 
comprise the posttest, and student posttest score was defined as the number correct of these 30 
items. This partially cumulative approach to testing was designed to provide a measure of the 
testing condition effects on individual learning. Using this strategy, students completed test items 
from unit 1 on the individual portion of the test Next students in the cooperative assessment 
condition discussed the same items from unit 1 in their groups. Then, on the second unit test 
students completed the 10 posttest items from unit 1 (as well as the 30 items from unit 2) on an 
individual basis prior to discussing test 2 with their groups. The posttest scores were taken from 
the first adrainisti^tion of the unit 2, 3, and 4 exams to measure the impact of the tireatment on 
individual performance. . . . , j . 

Motivation and Strategv Use Survev. This 57 item survey was admmistered to students 
immediately prior to completing the fourth exam. One item was omitted from our analysis because 
many students found it ambiguous. This item read, "Attending class was only helpful when 
something important for the test was covered." 

Greene and Miller (1993) completed a factor analysis of the survey and found items loadmg 
on six factors. See Table 1 for a listing of the factors, the number of items per factor, and the 
internal consistency measures of each factor for the original Greene and Miller study as well as for 
our analysis. 

Table 1 

Factor (# items) Greene alp ha Qriffin alpha 

learning gOal (4) .86 .74 

performance goal (4) .58 .74 

perceived abiUty (8) .74 .69 

self-regulation (17) .87 .77 

deep processing (11) .83 .83 

shallow processing (12) .73 .74 



Cooperative Assessment " 

All items were measured on a five-point Likert scale, ranging from Strongly Disagree (1) to 
Strongly Agree (5) and reverse-scored items were entered into the analysis accordingly. 

Group Assessment Survey . A 12 item survey was designed to tap student opinions and 
perceptions of the cooperative assessment process. These twelve items examined group process 
and student perceptions of the testing situation (See Appendix A). 



Procedures . j • i 

On tiie first day of tiie quarter all students completed tiie 30 item pretest of educational 
psychology concepts. Students also completed a questionnaire requesting demographic 
information which was used to assign students to heterogeneous groups. 

Students in all classes were assigned to cooperative groups of 4-5 members, which were 
mixed on tiie basis of gender, race, ability, and major. In terms of abiUty, each group consisted of 
a student witii a high GPA (3.5-4.0), tv/o students witii average CPAs (2.5-3.49), and one student 
witii a low GPA (below 2.49). Males and Black students were divided among tiie groups, and, to 
the extent possible after mixing tiie groups on all otiier variables, each group contamed at least one 
student who was not an early childhood education major (tiie predominant major m all classes). 
Students in all classes completed otiier types of group activities as well as cooperative assessment 
Students worked in tiieir groups to complete in-class non-graded activities designed to facilitate 
acquisition of course concepts. Each group also worked to cooperatively complete a senes of 
article critique papers concerning issues in educational psychology. These papers were graded and 
all group members received tiie same grade on each of tiie papers (4-5 papers, dependmg on tiie 
number of people in tiie group). In addition, tiie students in tiie cooperative assessment group 
completed tiie second administration of each unit exara woikmg cooperatively witii tiieir group 

members. „ • c i. 

Witii tiie exception of tiie difference in testing procedures, all sections ot tiie course were 
taught in tiie same manner. The two control group sections took place, one section each, dunng 
tiie summer and fall quarters of 1994. The experimental group sections were taught dunng tiie 
winter quarter of 1995. Care was taken by tiie instiiictor to use similar examples, to complete tiie 
same activities, and to cover tiie same amount of material per period in all classes. There were 
occasions, however, on which one group discussed sometiiing to a different deptii or from a 
different an^e tiian anotiier group. Altiiough tiiis does not allow for strict conti^l of tiie teachmg 
conditions, more effort to deter tiiese discussions was not made because tiie instructor was 
unwilling to interfere witii tiie quality of tiie instruction in order to implement stnct expenmental 
controls. Class sessions were typically a mixture of lecture, discussion, generation of examples, 
examination of practical applications of course content, and group activities. 

At tiie end of each of tiie four units students completed a unit exammation. Each exam was 
comprised of 40 multiple choice questions (primarily higher level items) and 2 essays itenis. 
Students in tiie experimental group were allowed to leave tiie room after completing tiie individual 
portion of tiie exam and were insOiicted to return at a time designated by tiie instiiictor. Stiidents in 
tiie experimental group worked togetiier to complete tiie test a second time after rettimmg to class. 
Each group was permitted to submit only one set of responses per group, so stiidents were torced 
to come to consensus on the responses to the test items. 

Exams were returned to stiidents witiiin tivo class periods for all sections of tiie course. 
During tiie period in which exams were returned, tiie instructor placed an answer tey on tiie 
overhead projector so stiidents could check tiie accuracy of tiie machme scormg.^ Copies ot Uie 
exam were not distributed to students, but copies were available in tiie instinictor s office tor 
Stiidents' perusal. Very few stiidents in eitiier ti^atinent condition chose to pursue this option. 

Immediately prior to completing tiie final examination all stiidents completed the Motivation 
and Strategy Use Survey. Stiidents in tiie experimental group also completed tiie Group 
Assessment Survey and submitted it on tiie day of tiie final examination. Following tiie tinal 
examination, stiidents were informed tiiat tiiey had been participating in an experiment ti^roughout 
tiie quarter. They were briefly told of tiie purpose of tiie stiidy and assured tiiat individual date 
would be kept confidential. At tfiis point, stiidents were also told tiiat tiiey could request tiiat tiieir 
date be witiidrawn from tiie stiidy if tiiey chose not to participate. Stiidents could exercise tins 
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option immediately or after grades were reported, in case they feared retribution for withdrawing 
their data from the study. No student chose to withdraw his or her results from the data pool. 

Toward the end of the quarter in -which data were collected for the treatment group, eight 
students from the cooperative assessment condition were asked to come in and participate in 
individual interviews with the third investigator. The third investigator was selected to conduct the 
interviews, rather than the first, in case smdents had any concerns to voice about the procedure. 
We anticipated that students would be less likely to offer criticisms of cooperative assessment to 
the instructor of the class in which they were completing the cooperative assessment procedure. 
Stratified random sampling was used to select the students; stratifications were based on self- 
reported GPA using the same categories used to assign students to cooperative groups. ImUaUy, 
nine smdents agreed to complete interviews, but one smdent from the higher GPA group later was 
unable to attend the interview. The interviews were audiotape recorded (with the permission ot the 
student). Each student was asked the same series of questions, but responses were explored m 
more depth as needed. 

Results 

All data were analyzed at the alpha = .05 level. Posttest data were analyzed using ANCOVA, 
with pretest scores entered as a covariate. No significant differences were found between the 
groups on the posttest, control group M = 21.03, SD = 3.29, treatment group M = 21.49, SD = 
3 66 Ffl 80) = 2 18; o =.14. The test for an interaction between pretest and treatment also 
yielded statistically insignificant differences, E(l. 80) = 1.32; © = .25. The effect of pretest scores 
F(l, 80) = 4.31; ji = .041 and GPA was significant, F(l, 80) = 20.40; £=.000, mdicaUng a 
positive relationship between pretest scores and posttest performance and GPA and posttest 
performance. Thus, the hypothesis of this smdy was not supported. 

Motivation and Strateev U se Survev , .,. , c a ^ • r.f 

The items for each of the six factors were aggregated withm those factors and analysis ot 
variance was conducted to determine if differences between the groups existed on any of the six 
factors. The results of these analyses are presented in Table 2. 

Table 2 

Factor df ItxdM 

learning goal F(l, 82) = 1.24 .27 

performance goal F(l, 82) = .02 .89 

perceived abUity F(l, 82) = .65 .42 

self-regulation F(l, 82) = 6.54 .01 

Jeep processing F(l, 82) = .04 .85 

shallow processing F(l, 82) = .04 .85 

The groups were not different at any traditional levels of significance on any of the factors except 
self-regulation. For this factor, the control group exhibited a higher mean than the treatment 
group. 

S urvey 

Analysis of survey data indicated that stadent perceptions toward cooperative assessment 
were quite positive. Mean scores, modes, and standard deviations for the Group Assessment 
Survey are presented in Appendix A (note that items 3, 9 and 12 are scored in reverse). Tliese 
results indicate that students tended to discuss test items with group members, prepared about the 
same amount for exams in this course (despite the fact they would receive extra pomts from group 
coUaboration) as they did for exams in other courses, and felt that taking tests in groups was 
somewhat beneficial to their grades. Students also beUeved that they were learmng more about the 
course content by discussing the exams with their groups. 
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Interview .. , • t. l . e 

The last source of data to be presented, interview data, was compatible with the outcomes ot 
the survey. The questions asked during the interview fell into two major categories, affective and 
cognitive/metacoghitive outcomes. Students indicated positive responses to most questions in both 
categories. That is, they reported that they were interested in learning, they valued the task, and 
they were determined to find the best answers and best supporting rationales for these answers. 

In sum, then, the outcomes on the posttest do not support the hypothesis of this study. 
Student survey and interview responses to the cooperative assessment strategy, however, were 
overwhelmingly positive. 

Discussion 

Overview and Posttest Outcomes ... 

This study examined the effect of cooperative assessment on the acquisition ot educational 
psychology course content It was predicted that students in the cooperative assessment group 
would outperform students in the individual assessment group on a posttest of educational 
psychology. This hypothesis was not supported. Certainly the first explanation to be considered 
regarding this finding is that cooperative assessment does not improve learning over an mdividual 
assessment format, as the results seem to indicate. Student perceptions of the procedure, however, 
appear to contradict this finding. As will be discussed ih greater detail, students reacted positively 
to the treatment and felt that they learned more from this method of testing than from traditional 
methods. Still, it must be noted, student perceptions are not always accurate mdicators of 
achievement gains. 

SurveY • • 

The results of the survey administared at the end of the quarter indicated a generally positive 
regard by students for the cooperative assessment procedure. Survey items used a 7-point Likert 
scale with higher numbers indicating more positive responses except for the reverse scored items, 
3, 9, and 12 (see Appendix A). Three major areas were emphasized in the survey questions: 
group testing process, study habits, and benefits of cooperative assessment 

Items 2, 4, 9, and 10 addressed the processes through which the groups went as they 
completed tests together. Note that item 9 is reverse scored, so that the lower mean corresponds 
with positive responses on items 2, 4, and 10. The survey indicates that students did tend to 
discuss the responses to test items more than they tended to rely on one person to provide a correct 
response and that all group members offered input to the discussion of mdividual items. Thus, it 
appears that students negotiated the responses to test items and attempted to reach a mutual 
understanding about the content This finding is consistent with the results of the mterview which 
indicate that for some items there was group discussion of the correct response, but for other 
items, group consensus, rather than discussion, was the norm. Observations of the group testmg 
phase by the iiistructor confirm that students did not discuss all items, but primarily discussed 
those items for which group members had selected a variety of responses. For these items, 
however, lively debates often ensued with different members trying to convmce others ot the 
accuracy of their individual response. . ui * 

Preparation for unit exams was addressed in items 1, 5, and 6. It seems qmte reasonable that 
students might study less for an exam on which they have the opportunity to improve their grades 
through group testing than in a traditional testing situation in which the irade they earn is solely tne 
result of their own efforts. This possibiUty was, in fact, what prompted us to allow the group 
exam to count only a maximum of 10 points toward a student's exam grade. If students can earn 
10 points, there is incentive to work with the group to earn the highest grade possible. lUere is 
also incentive, however, to prepare for the individual test since one must do reasonably weU in 
order to pass the exam even if all 10 points were earned. Students indicated that they and other 
members of their group were weU-prepared for the exams (item 1). Survey results (item 5) 
indicate that students studied about the same amount as they usually do, despite knowmg they had 
a group exam to complete. Of slightly more consequence to the students preparation was the 
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difficulty level of the exams (item 6); students indicated that they prepared somewhat more than 
usual due to the difficulty of the exams. _ . j * • 

Last, the survey examined the potential benefits of the cooperative assessment method. As m 
interviews, students focused on both the benefits of cooperative assessment to their course grades 
as well as to their understanding of course concepts. Note that items 8 and 11 (mtrinsic, learning 
goal) have higher means and lower standard deviations than item 7 (extrinsic goal). Question 12 
also looked at students' goal orientation, and ?eems to indicate that students leaned more toward a 
learning goal orientation. Note that the lower mean on this reverse scored item indicates that 
students were concerned with receivii.\g feedback from the group assessment, but were somewhat 
more concerned with the benefits of leaniing from their group. Overall, then, students believed 
that the cooperative assessment procedure did benefit their understanding of course matenal. This 
finding is another plus for using cooperative assessment - students perceive they are learning 
through this method as they work in groups to complete testing tasks. Group discussion of exams 
takes the responsibility for dispensing knowledge to the students out of the hands of the teacher 
and places it into the hands and minds of the students, encouraging them to be responsible for theur 
own acquisition and construction of knowledge. 

Interview „^ . / i • i 

The interviews focused primarily on issues related to affecuve outcomes (goals, social 
interactions of group members, self-efficacy) and cognitive/metacognitive outcomes (P^^"^8' 
monitoring and regulating group testing, evaluatmg, form of discussion) (^ Appendix B). We 
will first discuss the results pertaming to the affective outoome questions, followed by responses to 
cognition/metacognition questions. • j u- • 

When asked what their goal was during the group test, all students mentioned actuevmg a 
better grade or getting more points on the test This seems to indicate that stu(tents were extemaUy 
motivated. However, stiidents also indicated that they wanted to learn and understand the 
information on the exams, indicating some degree of mtemal motivation as weU. They valued the 
cooperative assessment task as being worihwhile and wanted each person in the group to 
participate. Stiidents mdicated that they evaluated each others' responses as possible ways .o 
understand tiie information. Perhaps tiien, stiidents were motivated to participate m group 
discussions by both intern^ and external forces. 

The procedure used by some of tiie groups to decide which items to discuss seemed to 
preclude many (perhaps too many) items from being debated and discussed. In some groups, it 
there was no disagreement about tiie correct response to an item, tiie item was not even read aloud. 
This appeared to Umit tiie number of items, and tiierefore tiie amount of content, discu^d. 
Perhaps, tiien, problems did not Ue in tiie level or quaUty of tiie discourse, but tiie breadtii of tiie 
discourse. As for tiie items which were discussed, we know littie about tiiem. Students were not 
asked to indicate which items tiiey discussed nor to what extent tiiey were discussed. Items may 
have been discussed because tiiey were vague or ambiguous, not because tiiey were particularly 
important Perhaps, tiien, tiiese items constrained students' attention to details or unimportant 
information. If students found it necessary to debate all test items maybe tiie strengtii of tiie 
cooperative assessment procedure would be increased. a * a- a 

Student self-efficacy was not challenged as much as it could have been. Some stiidents aia 
not feel completely challenged because tiiey did not have to discuss every item; tiiey could skip 
many items. Most stiidents said tiiat tiiey were challenged when they were debating and discussmg 
items, particularly if tiiey were certain tiieir response was correct and were tiymg to detena mat 
response. Some stiidents also mentioned tiiat tiiey did not have to risk much to participatein tins 
exercise; after all, tiieir individual grade could not go down, it could only be improved, l ftus, 
tiiere is littie to lose if tiie group does not improve. Perhaps self-efficacy could be better challenged 
(and subsequentiy improved) if tiie frequency of tile discussions mcreased. 

Cooperative assessment did not seem to change students' regular test preparation tactics. 
That is tiiey did not decrease tiieir amount of studying as a function of knowmg they could rely on 
tiie group for extra points and increased grades. They also did not however, modify tiieir 
stiidying to accommodate tiie different testing procedures required by tiie group test. 1 hat is, uiey 
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assumed that their regular test preparation tactics were appropriate for group testing. It is possible 
that their regular strategies were perfectiy adequate for cooperative assessment It is also possible, 
however, that smdents could have benefited more from the procedure if they had modified their 
study habits to accommodate group discussion as a function of cooperative assessment 

Students perceived the cooperative assessment activity as a problem solving task. They 
focused on explaining why their answers were correct or incorrect and tried to clear up confusing 
issues. They also evaluated each other's reasoning about information and tried to determine the 
best answers and rationales for the answers. Furthermore, tiiey discussed different perspectives 
on particular items and valued comparing different viewpoints. Students indicated that they were 
actively monitoring tiieir peers' discussion when there was disa^ment especially when there 
was an "even split" with equal numbers of group members arguing /or two different responses. 
Many diverse examples were generated during group discussion, indicating tiiat smdents were 
clarifying and re-examining tiieir knowledge of tiie information tiirough interaction. However, all 
of this discourse activity was limited to the few items which were actually discussed. The question 
remains, then, how can smdents be encouraged to discuss all of tiie test items? 

One possible, and obvious, solution to this problem is to have smdents work on the tests as a 
group from the outset - that is, eliminate tiie individual portion of tiie assessment While this might 
encourage more discussion, it also eliminates tiie provision made for mdividual accountabiUty and 
may encourage more "ftee-riders." Anotiier possible solution is to generate test items of a different 
type and/or difficult enough across all items to encourage more discussion. A ti^ird potential 
solution, and one which will be investigated in future studies, is to generate different items for tiie 
individual and group assessments. By taking tius measure, individual accountability could be 
retained while stodents would be more likely to, minimally, read all items aloud, and, optimally, 
discuss all of tiie items because they had not seen any of them previously. 
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Appendix A 

MeanSy Modes^ and Standard Deviations for the Group Assessment Survey 

Item Mean Mode SD 

1. To what extent was each group member 6.04 6 0.94 
prepared for the exams? 

(unprepared to very well prepared) 

2. To what extent did each group member 6.47 7 0.94 
participate and offer input as you completed 

the group exams? 

(did not participate or offer input to 
participated and offeied extensive input) 

3 . My group members seemed to study less and 1 .70* 1 1-26 
rely on the group*s success to raise their exam 

grades. 

4. In general our group discussed and debated 5.78* 7 1-68 
the answers to most of the test items. 

5. What effect did taking the group exam have 4.77 4 0.97 
on your study habits? 

G studied much less than usual to I studied 
much more than usual) 

6. What effect did the difjBculty level of the 5.30 6 1.08 
exams have on your study habits? 

(I studied much less than usual to I studied 
much more thai^ usual) 

7. Taking the tests as part of a group was bene- 6.10* 7 1-19 
ficialtomy grade. 

8. Taking the tests as part of a group was bene- 6.42* 7 0.77 
ficial to my understandmg of requtrcd course 

concepts. 

9. In goieral, our group did not debate the 2.33* 1 1-93 
responses to most items, but relied upon one 

or two members to provide the answers. 

10. The group as a whole woriced together to 4.94* 6, 7 1 .9 1 
complete all of the essay items. 

11. As I completed the group exam, I gamed a 6.50* 7 0.83 
better understanding of content I missed when 

I took the test individually. 
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12. As I completed the group exam, I was more 3.00* 4 1.41 

concerned with detennining which items I 
answered correctly on my own than with 
learning from the group. 

* All items used a 7-point Likert scale, with higher numbers indicating more positive responses with the exception 
of items 3, 9, and 12 in which case the lower numbers indicate a more positive response. End points of the 
Likert scale for aH items (unless otherwise indicated foUowing the item) were "Strongly disagree" (1) to 
"Strongly agree" (7), 
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Appendix B 

Interview Questions and Response Summaries 

L Affective/Process Outcomes 
A. Goal 

Q: What was the purpose of the peer-group test? 

1. Problem solving , 

- to explain why answers are correct or mcorrect (clear up confusmg pomts or 

wrong answers) 

- discuss reasoning behind answers 

2. Multiple views are highlighted 

- to see others' perspectives, others' thinking 

3. Benefits of group discussion 

- discussion should improve learning, memory, understanding 

Social/Behavior al Interactions of Group Members 

Q: Did everybody in the group do about the same amount of talking or did one particular person 
dominate the discussion? Was it a pleasant experience? 

1. Amount of talking . r u ,u ♦ 

- students who knev/ more or had more experience talked more, but most leit mat 

all students had equal power 

2. Pleasant experience 

- pleasant experience to discuss the test with peers 

Q: How did you respond to each other's comments during the group testing session? Can you 
give me an example of your response to someone in the group? 

Students typically . , . 

- engaged in positive exchanges (equity, respect, fairness, acceptance, no nostuity; 

- tried to understand the reason behind peers' responses to items 

- tried to evaluate the best answer 

C. Self-efficacv 

Q: Was die group test a difficult chaUenge for you and for your group? Why or why not? 

- students apparentiy did not feel challenged except when a lot of disagreement occurs, then 
they feel challenged to resolve the disagreement 

n. Cog nitivpyMetacognitive Outcomes 

A. Planning 

Q: Did you do anytiiing special to prepare for the peer-group testing activity outside of class? If 
so, what? 

1. Study strategies , , ^ / • ♦ * 

. Group assessment did not change students' typical meUiods (strategies) tor test 
preparation* Ratfier, tfiey tried to apply tiieir usual study techniques to fir 
the group discussion activity. 

2, Level of preparation 

- some felt more responsible to be more prepared tiian usual because they were 

concerned about peer pressure 
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Q: What was your goal during the activity? What did you hope to accomplish? 

1. External goal 

- All students mentioned increasing grades, number of pomts, or score on test 

This seemed to be their primary goal. 

2. Internal goal , , , 

- 7 of 8 students also mentioned that they wanted to learn, to understand, to get 

other perspectives on the material, or to clarify misunderstandings 

R, lyinnitoring and Regulatmg 

Q: What was the most difficult part of the group test? 

1. Disparity of answers 

- difficulty occurred especially when there was great dispanty m group answers 

2. Consensus-seelong 

- trying to get everyone to agree which was difficult because either 

1) no one is sure of the content Gack of knowledge) 

2) each person is more likely to believe their answer is correct 

3) more persuasion had to be performed, which requires more effort 

Q: Did you spend you time on some particular items more than others? If so, why? 

1. Multiple choice items 

- particular items created more disagreement, taking more time to work through 

discussion and reach consensus 

2. Essay , . , . a- 

- required students to fust decide who responded to which item, to discuss 

individual answers, and to choose or construct best group response 

Q: Did you ever go back to items you akeady answered? Why? . i • u 

-students paid attention to and utilized test taking strategies to regulate ttieu- thinkmg, such as 
noting difficulty of item, elapsed time, illumination of previous response by later 
discussion 

Q: What did you do when you didn't know an answer or didn't know if your answer was correct? 
Various strategies used ^ 

- went witii best explanation or example, deciding tius person must know answer 

- temporarily sldp the item and come back to it 

- went witii majority opinion 

- went with "expert** opinion (rarely done) 

- guessed (rarely done) 

Q: When you were taking tiie group test, how did you decide on tiie best answers to tiie multiple 
choice items? Give an example, ^ ^ u ^ ^« 

•First, groups would ask for each person's response to an item- Procedures vaned based on 

consensus or lack of consensus. 
L Consensus 

- If all agreed, tiien go on to next item witii httie or no discussion 
2 Lack of consensus 

- some disagreement, but majority agree then groups discussed a little bit but often 

opted for majority view 

- major disagreement, with equal representation of two responses lead to much 

discussion and a search for the best explanation or justification, followed by 
a decision on correct response 
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Q: When you were taking the group tesu how did you decide on the answers to the essay? Give 
an example, 

•First students deternuned which essay question was answered most frequenUy. 

1. All or majority answered same question 

- each reported and explained Iheir answer, then group discussed the answers and 

selected the best response to use as the group response 

2. If different questions were answered 

- group decided on one to respond to, but some people were left out ot the response 

generation 

C Evaluating Activity 

Q: What did you learn in the group testing process? 
Process helped to clarify information 

- generated more and diverse examples of concepts from different perspectives 

- remembered teacher's or textbook's examples (review) 

- clarified relationships among concepts 

- applied concepts rather than merely memorizing definitions 
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